Search results for "Ranking data"

showing 9 items of 9 documents

Weighted distance-based trees for ranking data

2017

Within the framework of preference rankings, the interest can lie in finding which predictors and which interactions are able to explain the observed preference structures, because preference decisions will usually depend on the characteristics of both the judges and the objects being judged. This work proposes the use of a univariate decision tree for ranking data based on the weighted distances for complete and incomplete rankings, and considers the area under the ROC curve both for pruning and model assessment. Two real and well-known datasets, the SUSHI preference data and the University ranking data, are used to display the performance of the methodology.

Statistics and ProbabilityDecision tree03 medical and health sciences0302 clinical medicine0504 sociology030225 pediatricsPreference dataStatisticsDecision treePruning (decision trees)University ranking dataDistance-based methodMathematicsWeighted distanceApplied Mathematics05 social sciencesUnivariate050401 social sciences methodsSUSHI dataComputer Science Applications1707 Computer Vision and Pattern RecognitionPreferenceComputer Science ApplicationsRankingRanking dataKemeny distanceSettore SECS-S/01 - StatisticaArea under the roc curve

researchProduct

Boosting for ranking data: an extension to item weighting

2021

Gli alberi decisionali sono una tecnica predittiva di machine learning particolarmente diffusa, utilizzata per prevedere delle variabili discrete (classificazione) o continue (regressione). Gli algoritmi alla base di queste tecniche sono intuitivi e interpretabili, ma anche instabili. Infatti, per rendere la classificazione più affidabile si `e soliti combinare l’output di più alberi. In letteratura, sono stati proposti diversi approcci per classificare ranking data attraverso gli alberi decisionali, ma nessuno di questi tiene conto ne dell’importanza, ne delle somiglianza dei singoli elementi di ogni ranking. L’obiettivo di questo articolo `e di proporre un’estensione ponderata del metodo …

boosting weighted ranking data ensemble methods decision treesSettore SECS-S/01 - Statistica

researchProduct

Ensemble methods for item-weighted label ranking: a comparison

2022

Label Ranking (LR), an emerging non-standard supervised classification problem, aims at training preference models that order a finite set of labels based on a set of predictor features. Traditional LR models regard all labels as equally important. However, in many cases, failing to predict the ranking position of a highly relevant label can be considered more severe than failing to predict a trivial one. Moreover, an efficient LR classifier should be able to take into account the similarity between the items to be ranked. Indeed, swapping two similar elements should be less penalized than swapping two dissimilar ones. The contribution of the present paper is to formulate more flexible item…

Ensemble methodsRanking dataLabel rankingSettore SECS-S/01 - Statistica

researchProduct

Recursive partitioning: an approach based on the weighted kemeny distance

2015

In the framework of preference rankings, the interest can lie in finding which predictors and which interactions are able to explain the observed preference structures. The possibility to derive consensus measures using a classification tree represents a novelty and an important tool, given its easy interpretability. This work proposes the use of a univariate decision tree for ranking data based on the weighted Kemeny distance. The performance of the methodology will be shown by using a real dataset about university rankings.

Classification tree distance based methods ranking data Kemeny distance.Settore SECS-S/01 - Statistica

researchProduct

ENSEMBLE METHODS FOR RANKING DATA

2017

The last years have seen a remarkable flowering of works about the use of decision trees for ranking data. As a matter of fact, decision trees are useful and intuitive, but they are very unstable: small perturbations bring big changes. This is the reason why it could be necessary to use more stable procedures, as ensemble methods, in order to find which predictors are able to explain the preference structure. In this work ensemble methods as BAGGING and Random Forest are proposed, from both a theoretical and computational point of view, for deriving classification trees when ranking data are observed. The advantages of these procedures are shown through an example on the SUSHI data set.

ranking data ensemble methods bagging random forestSettore SECS-S/01 - Statistica

researchProduct

DISTANCE‐BASED DECISION TREES FOR RANKING DATA: THE ROLE OF THE WEIGHT SYSTEMS

2018

In everyday life ranking and classification are basic cognitive skills that people use in order to grade everything that they experience. Grouping and ordering a set of elements is considered easy and communicative; thus, rankings of sport‐teams, universities, countries and so on are often observed. A particular case of ranking data is represented by preference data, where individuals show their preferences over a set of items. When individuals specific characteristics are available, an important issue concerns the identification of the profiles of respondents (or judges) giving the same/similar rankings. In order to incorporate respondent‐specific covariates distance‐based decision tree mo…

Ranking data weights consensusSettore SECS-S/01 - Statistica

researchProduct

Ensemble methods for ranking data with and without position weights

2020

The main goal of this Thesis is to build suitable Ensemble Methods for ranking data with weights assigned to the items’positions, in the cases of rankings with and without ties. The Thesis begins with the deﬁnition of a new rank correlation coefﬁcient, able to take into account the importance of items’position. Inspired by the rank correlation coefﬁcient, τ x , proposed by Emond and Mason (2002) for unweighted rankings and the weighted Kemeny distance proposed by García-Lapresta and Pérez-Román (2010), this work proposes τ x w , a new rank correlation coefﬁcient corresponding to the weighted Kemeny distance. The new coefﬁcient is analized analitically and empirically and represents the main…

ranking databoostingweighted Kemeny distancebaggingSettore SECS-S/01 - Statisticalinear mixed modelensemble method

researchProduct

A weighted distance-based approach with boosted decision trees for label ranking

2023

Label Ranking (LR) is an emerging non-standard supervised classification problem with practical applications in different research fields. The Label Ranking task aims at building preference models that learn to order a finite set of labels based on a set of predictor features. One of the most successful approaches to tackling the LR problem consists of using decision tree ensemble models, such as bagging, random forest, and boosting. However, these approaches, coming from the classical unweighted rank correlation measures, are not sensitive to label importance. Nevertheless, in many settings, failing to predict the ranking position of a highly relevant label should be considered more seriou…

Artificial IntelligenceDecision treesGeneral EngineeringLabel rankingWeighted ranking dataEnsemble methodBoostingComputer Science ApplicationsExpert Systems with Applications

researchProduct

New Flexible Probability Distributions for Ranking Data

2015

Recently, several models have been proposed in literature for analyzing ranks assigned by people to some object. These models summarize the liking feeling for this object, possibly also with respect to a set of explanatory variables. Some recent works have suggested the use of the Shifted Binomial and of the Inverse Hypergeometric distribution for modelling the approval rate, while mixture models have been developed for taking into account the uncertainty of the ranking process. We propose two new probabilistic models, based on the Discrete Beta and the Shifted-Beta Binomial distributions, that ensure much flexibility and allow the joint modelling of the scale (approval rate) and the shape …

Flexibility (engineering)RankingBinomial (polynomial)Computer scienceRank (computer programming)EconometricsProbability distributionScale (descriptive set theory)Discrete Beta Ranking data Shifted-Beta BinomialRanking data Discrete Beta Shifted-Beta BinomialMixture modelSettore SECS-S/01 - StatisticaHypergeometric distribution

researchProduct